On The Classification Of Indic Languages

نویسنده

  • Subhash Kak
چکیده

Language, as part of human expression, may be viewed in analogy with genetic expression. Evolution of language is a result of complex temporal and spatial processes where, if one could aggregate the processes, one may speak in terms of parent traits and the resultant descendent traits. Insights from the theory of non-linear dynamics indicate that the multitude of interactions amongst speakers would lead to the formation of just a few languages. Strongly interacting systems of very many components, like assemblies of neurons or human speakers, have only a few stable interaction states, called attractors, associated with their behaviour,1and these, for speakers, are the various languages. In evolving systems, the nature of these stable states will also change. This is how isolated languages can be seen to change. But more significant than this process is the change due to interaction with other languages. With this background it is clear that a correct view of language evolution is within the framework of other interacting languages. But for about one and a half centuries, language evolution has been studied using models inspired by early, mechanistic physics. Like a physical system that evolves due to radiation and other incident forces, languages were taken to change spontaneously. The spread of languages was explained by another mechanistic metaphor, namely, that of transfer of populations and invasions. This led to models of language families. The German philologist August Schleicher pioneered the tree approach in the 1860’s which assumes that when populations are isolated their speech get increasingly differentiated until they become distinct languages; this assumption allows one to set

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comprehensive Analysis of Stemmers Available for Indic Languages

Stemming is the process of term conflation. It conflates all the word variants to a common form called as stem. It plays significant role in numerous Natural Language Processing (NLP) applications like morphological analysis, parsing, document summarization, text classification, part-of-speech tagging, question-answering system, machine translation, word sense disambiguation, information retrie...

متن کامل

Lexical Semantics and Selection of TAM in Bantu Languages: A Case of Semantic Classification of Kiswahili Verbs

The existing literature on Bantu verbal semantics demonstrated that inherent semantic content of verbs pairs directly with the selection of tense, aspect and modality formatives in Bantu languages like Chasu, Lucazi, Lusamia, and Shiyeyi. Thus, the gist of this paper is the articulation of semantic classification of verbs in Kiswahili based on the selection of TAM types. This is because the sem...

متن کامل

Indica, an Indic preprocessor for TEX A Sinhalese TEX System

In this paper a two-fold project is described: the first part is a generalized preprocessor for Indic scripts (scripts of languages currently spoken in India—except Urdu—, Sanskrit and Tibetan), with several kinds of input (LTEX commands, 7-bit ascii, CSX, ISO/IEC 10646/unicode) and TEX output. This utility is written in standard Flex (the gnu version of Lex), and hence can be painlessly compil...

متن کامل

A Text Input Scheme for Indic Languages with Large Numbers of Print- able Characters

This paper discusses design and development of a text-input scheme for phonetic Brahmic languages with a large number of printable characters. We devise an input scheme for an exemplar Indic language with the understanding that the findings are generalizable to other Indic languages. Our results show that a casual user is able to type at a reasonable speed with our approach.

متن کامل

The Festvox Indic Frontend for Grapheme-to-Phoneme Conversion

Text-to-Speech (TTS) systems convert text into phonetic pronunciations which are then processed by Acoustic Models. TTS frontends typically include text processing, lexical lookup and Grapheme-to-Phoneme (g2p) conversion stages. This paper describes the design and implementation of the Indic frontend, which provides explicit support for many major Indian languages, along with a unified framewor...

متن کامل

Analysis of Phonetic Matching Approaches for Indic Languages

Phonetic matching plays an important role in multilingual information retrieval, where data is manipulated in multiple languages. User needs information in their local language which may be different from the language where data has been maintained. In such an environment, we need a system which matches the strings phonetically irrespective of errors either exactly or approximately. There are m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994